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ITION OF NON-CD4 MEDIATED HIV INFECTION 



TECHNICAL FIELD OF THE INVENTION 

5 

The present invention is directed to a non-CD4 cell surface receptor for gpl20. 
This gpl20 receptor (gpl20r) has been isolated and cloned and is utilized in the present 
invention in methods and kits for the inhibition and detection of HTV infection. 

10 BACKGROUND OF THE IN VENTION 

Two types of human retroviruses have been identified, leukemia viruses and AIDS- 
related viruses. The primary targets of the human retroviruses are T lymphocytes and cells 
of the central nervous system. All human retroviruses are transmitted by intimate contact, 

15 blood contamination, and infection in utero or after birth by milk. It is likely that all 
human retroviruses originated in Africa and that they encountered the human species via 
interspecies infection, possibly from African green monkeys or a related species. The 
human retroviruses first discovered, Human T Lymphotropic Virus Type 1 (HTLV-1) and 
Human T Lymphotropic Virus Type n (HTLV-II), have a preferential tropism for T4 cells 

20 and some T8 cells, share significant sequence homology, and are mainly associated with T 
cell leukemias and lymphomas. The other group of human retroviruses, generally called 
Human Immunodeficiency Viruses (HIV), is discussed in greater detail below. There are 
two major differences between the two types of human retroviruses: (1) there is substantial 
genomic variability among various HIV isolates, whereas the genomes of HTLV-I and 

25 HTLV-II are stable; and (2) HTV entered human populations much more recently than 
HTLV-I or HTLV-II. 

The human immunodeficiency virus (HIV) is a cytopathic retrovirus and the 
causative agent of the acquired immunodeficiency syndrome (AIDS). Two forms of HIV 
have now been identified. Hie prototype virus, HTV-1, previously termed 

30 lymphadenopathy-associated virus (LAV) and Human T Lymphotropic Virus Type m 
(HTLV-1H), is responsible for the vast majority of reported AIDS cases worldwide. 
Another retrovirus, HTV-2, has been isolated primarily from West African patients with 
AIDS and is pathogenically related to HTV-1 . On the genetic level, HTV-2 is actually more 
closely related to the simian immunodeficiency virus (STV), a retrovirus infecting 

35 monkeys. 

Over half of the people that have contracted AIDS in the United States have already 
died. As many as three million persons in this country may be asymptomatic carriers of 
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HIV and ate capable of transmitting the virus. It had been estimated in 1986 that 270,000 
cases of AIDS will have occurred in the United States by 1991 (U.S. Public Health 
Service, (1986), Public Health Rep. 101:341). The mortality rate from AIDS is 
disturbingly high, exceeding 80% within three years of diagnosis and possibly reaching 
5 100% over a longer period. 

Worldwide, the AIDS epidemic may involve some five to ten million presently 
infected persons. Particularly troublesome are statistics from the African continent where 
millions of individuals are believed infected with HIV, deaths range in the hundreds of 
thousands, and heterosexual transmission predominates. To date, there is neither a known 
10 cure for AIDS nor an effective vaccine against HIV infection. 

Hiv is a member of the nontrarisforming, cytopathic lentivirus family of 
retroviruses. HTV causes a typically fatal disease characterized by severe 
immunodeficiency or neurodegenerative disease, or both. The primary basis for HIV 
induced immunosuppression is the depletion of the helper/inducer subset of T lymphocytes 
15 expressing the CD4 molecule <T4 or CD4 + cells), which serves as a high affinity cell 
surface receptor for the virus. T4 lymphocytes are involved directly or indirectly in the 
induction of nearly every immunologic function in the body, and their depletion results in 
susceptibility to a wide range of opportunistic infections and neoplasms. 

In addition to the T4 lymphocyte, other cells expressing the CD4 molecule are 
targets of HTV infection, especially monocyte-macrophages. HTV infection also results in 
serious B cell abnormalities including polyclonal activation, hypergammaglobulinemia, 
elevated levels of circulating immune complexes, and autoantibodies. A decreased number 
of functional natural killer (NK) cells have also been observed in AIDS patients. 

Infection of CD4 + cells is initiated by the interaction of the CD4 molecule with the 
25 major HTV envelope glycoprotein gpl20, an event which is followed by internalization and 
uncoating of the virion, transcription of genomic UNA to DNA by virus-encoded reverse 
transcriptase, and integration of the resulting proviral DNA into host cell chromosomal 
DNA. Also, unintegrated proviral DNA accumulates in large amounts within infected cells 
and is probably a significant factor in HTV cytopathology (Shaw et al., (1984) Science 
30 226:1165). 

The depletion of CD4 + T cells appears to contribute significantly to the 
immunosuppression associated with AIDS. A primary cytopathic effect of the virus in 
vitro is HIV-induced syncytium formation. CD4, through its interaction with gpl20 plays 
an important role in syncytium formation. However, it has been observed that molecules 
35 on the cell surface of uninfected cells other than CD4 are also involved in HIV-induced 
cell fusion (Hildreth et al. (1989) Science 244:1075-1078). 
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Infection by HTV produces, in addition to AIDS, a set of neuropsychiatric disorders 
which are called the AIDS dementia complex (ADC) (Price et al., (1988)^22:586-592). 

The symptoms of ADC include cognitive impairment, apathy and motor 
dysfunctions, and may affect as many as 90% of AIDS victims. The underlying cause of 
5 ADC appears to be the death of brain cells and HIV-1 can be isolated from the brains of 
infected individuals (Ho et al, (1987) N. Eng. J. Med. 212:278-286). 

An early study suggested that the cellular attachment site for HTV in brain might be 
CD4 (Pert et al., (1986) Proc. Natl. Acad. Sci. USA 52:9254-9258) but attempts to 
replicate these findings were not successful (Kozlowski et al., (1989) Neurosci. Abstr. 
10 15_:671). It now appears unlikely that the CD4 antigen is involved in the infection of 
brain-derived cells by HIV. Susceptibility of brain cells to infection with HIV-1 does not 
correlate with the level of expression of CD4 (Chang-Mayer et al., (1987) Proc. Natl. 
Acad. Sci. USA 84:3526-3530; Srinivasan et al., (1988) Arch. Virol. 25:135-141), and 
infection of brain-derived cells by HIV-1 is not blocked by anti-CD4 antibodies (Clapham 
15 et al., (1989) Nature 222:368-370; Li et al., (1990) I. Virol. 64:1383-1387). 

The present invention demonstrates the presence of a non-CD4 receptor for gpl20 
and a method for the inhibition of HTV infection of cells such as brain and muscle which 
do not express high levels of CD4. 

20 ETTMMARY OF THE INVENTION 

Many cells mat are susceptible to HTV infection appear to bind gpl20 through a 
non-CD4 surface protein. The present invention has identified this non-CD4 gpl20 
receptor (gpl20r) and has recombinantly expressed and characterized gpl20r. 

25 In this invention a specific non-CD4 gpl20r has been isolated which has specific 

binding activity for gpl20 present on Human Immunodeficiency Virus- 1 (HTV). This 
gpl20r has a molecular weight of about 45, 000 daltons, contains about 400 amino acid 
residues and is characterized by a Kd for gpl20 of about 1.3 nM to about 2.0 nM. The 
binding of gpl20 to gpl20r is inhibited by specific carbohydrates, such as mannose and 

30 fucose, plant lectins such as concanavalin A and specific antibiotics, such as pradimicin A. 

In one embodiment of the present invention, a cDNA molecule that transcribes an 
nvRNA encoding for gpl20r is cloned and expressed to produce gpl20r. The DNA is 
selected from a gene library obtained from tissue such as placenta, brain, muscle and 
colon. 

35 A method of inhibiting HTV infection of mammalian cells, such as brain, muscle 

and neural cells, is contemplated by the present invention. In this method, cells are 
contacted with an effective amount of an appropriate inhibitor of gpl20r binding for a time 
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^G^Tmustrates expression cloning of the gpl20r cDNA and comparison to 

Autoradiography of gpl20 binding to gpl20r and ^ in 
cells. A-F^vgpl20;A,gp^ 
200 nM unlabelled bgpl20; D, CD4; E, CD4 with G17-2 F CD4 wtfh 
bgpl20. G-L [^ngp^O; G, gpl02r, H, gpl20r wtfh 110.1; I, gpl20r 
with bgpl20; J, CD4: k, CD4 with 110.1; L, CD4 with bgP™- 
B: inhibition of [^vgpttO binding to gpl20r and CD4. A-F^0randG- 
L CD4. A+G, HIV antisera (1:20; Trimar); B+H, D-galactose (100 n^, 
C+I, D-mannose (100 mM); D+J, L-tucose (100 mM); E+K, 
Concanavalin A (1 mg/ml); F+L, pradimicin A (100 /xg/ml). 
C: gpl20r binding of HIV. A , HTV; B, HTV with 200 nM bgpl20. 

FIGURE 2 illustrates the characterization of the gpl20r. 

A . scatchard analysis of [^120 binding. A - A, vgp!20 bmdmg to 
placenta, Kd 1.3 nM, IW 19 fmol/mg protein; ■ with ^ ?17-2; 
• V gpl20 binding to gpl20r COS cells, Kd 1.7 nM, IW 150,000 
ie U»tors/cell(R/C);O,ngpl20,Kd 1.8 nM, 149,000R/C 

B* Inhibition of [ 125 I]gpl20 binding to gpl20r COS cells. Open symbols 
ngP 120, filled symbols vgpl20. The relative values were the same^th 
both forms of gpl20. Mannan expressed as mg/ml. □, — ^50 6 
ug/ml); •, L-fucose (K^mM); A, a-methyl D-mannoside (K- 15 mM), 
O, D-mannose 23 mM); O, N-acetylglucosamine (K. 70 mM), ■, 
FXJTA^jO.S mM). 
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C: Internalization of gpl20 by gpl20r COS cells. Points represent the mean of 
two experiments with vgpl20 and ngpl20. • - #, suface; O - O internal. 

D: Placenta control sera; 2, placenta HTV sera; 3, gpl20r COS control sera; 4, 
gpl20r COS HTV sera. 

E: Northern blot of gpl20r expression. Polyadenylated (A + ); 2, placenta; 3, 
thymus; 4+12, forebrain; 5, skeletal muscle; 6, heart; 7, liver; 8, kidney; 
9, colon; 10 medulla; 11, cerebellum; 13, T cell (CEM; 16 /ig A + ) 14, B 
cell (TS-1; 16 /ig A + ); 15, macrophage (U937; 8 /ig A+); 16, cervical 
carcinoma (HeLa; 16 ng A + ). The different apparent size of the ~5 kb 
band is an artifact of displacement by 28S rRNA. 

FIGURE 3 illustrates the sequence analysis of the gpl20r. 

A: Nucleotide and deduced protein sequence of gpl20r cDNA. 

B: Hydropathicity plot of the gpl20r. The predicted transmembrane segment 

and the start of the eight amphipathic repeats are indicated by arrows. 
C: Aminoacid alignment of the gpl20r C-type lectin domain. 



HTV infection of brain and muscle cell lines is not blocked by soluble CD4 or and- 
CD4 antibodies (Clapham, P.R. et al., (1989) Nature 222:368-370; Harouse, J.M. et al., 
(1989) J. Virol. &:2527-2533; Weber, J. et al., (1989) J. Gen. ViroL 7Q:2653-2660). 
This is consistent with the existence of a second gpl20 receptor. Binding studies indicated 
that human placenta was another source for a non-CD4 gpl20 receptor, and a cDNA for a 
second gpl20 receptor (gpl20r) was isolated by the present invention from a placental 
library. The gpl20r has a higher binding affinity for gpl20 than CD4. Sequence analysis 
revealed homology to membrane associated C-type lectins, and inhibition studies have 
shown that the receptor binds gpl20 through a mannose or fucose containing carbohydrate. 
The gpl20r rapidly internalizes gp!20, and is expressed in placenta, thymus, muscle, and 
colon. These results, when considered with previous studies on the role of gpl20 
carbohydrate in HTV infection (Lifson, J. et al., (1986) J. Exp. Med. 164:2101-2106: 
Ezekowitz, R.A.B. et al., (1989) J. Exp. Med. 1^2:185-196; Larkin M. et al., (1989) 
AIDS 2: 793-798; Tanabe-Tochikura A. et al., (1990) Virology 126:473-476), suggest a 
potential role for the gpl20r in HIV infection or pathology. 

The present invention demonstrates that the gpl20r participates in cellular binding 
of HIV by a non-CD4 pathway in muscle and brain, as well as, facilitating virus 
attachment in CD4 positive cell types. It is likely that the gpl20r plays a significant role in 
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transplacental transport of HIV (Zacher, V. et aL, (1991) J. Virol. £5:2102-2107) and 
colon infection. (Barnett, S.W. et al. (1991) Virol. l£2:802-809). Gpl20 produces an 
increase in intracellular calcium in rat retinal ganglion cells (Dreyer, E.B. et aL, (1990) 
Science 24£:364-367) suggesting that the gpl20r or a homologous protein may have 
5 signaling functions in the nervous system disrupted by gpl20 leading to HIV neurotoxicity. 

In the present invention, a new non-CD4 binding protein, or receptor, for gpl20 
was isolated. The HIV surface protein gpl20 was found to bind to a receptor on human 
placental membranes that was not blocked by antibodies directed against CD4, such as 
G17-2 and OKT4a, and which interfere with gpl20 binding to CD4. A cDNA encoding 
10 this receptor was isolated from a placental cDNA library in a mammalian expression vector 
(pCDM8). The gene products were expressed in COS cells and were screened by 125 I- 
labelled gpl20 binding. From a pool of 90,000 cDNA molecules, a single clone was 
isolated that encoded a protein which bound gpl20, even in the presence of concentrations 
of anti-CD4 antibody (G17-2) which completely blocked gpl20 binding to GD4. 
15 Sequence studies were carried out and indicated that the 1.5 kQobase cDNA clone 

encoded a previously unknown member of a family of Type II membrane proteins with an 
extracellular C type lectin domain. 

The cloned gpl20r of the present invention binds gpl20 with an affinity (Kd) of 
about 1 to 2 nM, which is considerably greater than the affinity of CD4 for gpl20 (about 
Kd = 4 nM). 

The binding of gpl20 to gpl20r is not blocked by polyclonal HIV antisera, but is 
inhibited by mannose carbohydrates, fucose carbohydrates, plant lectins such as 
concanavalin A and pradimicin A antibiotics. Other sugars such as N-acetyl-d-glucosamine 
and galactose are less potent inhibitors. 

The gpl20r is expressed on many mammalian cells which do not exhibit high levels 
of GD4, such as placenta, skeletal muscle, brain, and mucosal cells. Other tissue and cells 
displaying gpl20r include colon, thymus, heart, T cells, B cells and macrophages. The 
distribution of tissue having gpl20r parallels that for binding of gpl20 which is not 
blocked by CD4 antibodies, and for HIV infection which is not neutralized by soluble 
CD4. This observation suggests a role for gpl20r in viral infection. 

In gpl20r expressing transfected COS cells, gpl20 is rapidly internalized following 
binding to gpl20r. This binding and internalization of gpl20 is inhibited by compounds 
such as mannan, concanavalin A and pradimicin A. 

In the present invention a cDNA which encodes gpl20 was isolated and cloned. A 
DNA molecule of the present invention corresponds to a complementary DNA molecule 
which transcribes a messenger RNA (mRNA) molecule which, when translated, encodes 
gpl20r. The cDNA molecules were obtained by reverse-transcribing mRNA molecules 
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isolated from mammalian tissue such as placenta, colon, brain or thymus. The 
transcription and cloning of cDNA molecules and isolation of gene products are techniques 
well known in the art and, for example, are described in Sambrook et al., " Molecular 
Cloning: A Laboratory Manual " r 2d edition, Cold Spring Harbor Lab,, Cold Spring 
Harbor, NY (1989), which is incorporated herein by reference. 

As used herein, the phrases "physiologically tolerable" and "pharmaceutical^ 
acceptable 19 refer to molecular entities and compositions that do not produce an allergic or 
similar untoward reaction, such as gastric upset, dizziness and the like, when administered 
to a mammal. Hie physiologically tolerable carrier may take a wide variety of forms 
depending upon the preparation desired for administration and the intended route of 
administration, 

A carrier is a material useful for administering the active compound and must be 
"acceptable" in the sense of being compatible with the other ingredients of the composition 
and not deleterious to the recipient thereof. 

The pharmaceutical compositions are prepared by any of the methods well known in 
the art of pharmacy all of which involve bringing into association the active compound and 
the carrier therefor. 

For therapeutic use, the agent utilized in the present invention can be administered 
in the form of conventional pharmaceutical compositions. Such compositions can be 
formulated so as to be suitable for oral or parenteral administration, or as suppositories. In 
these compositions, the agent is typically dissolved or .dispersed in a physiologically 
tolerable carrier. 

As an example, the compounds of the present invention can be utilized in liquid 
compositions such as sterile suspensions or solutions, or as isotonic preparations containing 
suitable preservatives. Particularly well suited for the present purposes are injectable 
media constituted by aqueous injectable isotonic and sterile saline or glucose solutions. 
Additional liquid forms in which the present compounds may be incorporated for 
administration include flavored emulsions with edible oils such as cottonseed oil, sesame 
oil, coconut oil, peanut oil, and the like, as well as elixirs and similar pharmaceutical 
vehicles. 

The present agents can also be administered in the form of liposomes. As is known 
in the art, liposomes are generally derived from phospholipids or other lipid substances. 
Liposomes are formed by mono- or multilamellar hydrated liquid crystals that are 
dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
metabolizable lipid capable of forming liposomes can be used. The present compositions 
in liposome form can contain, in addition to the agent of the present invention, stabilizers, 
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presexvatives, e*F« - *• ^ "* * 
phosphatidyl choline (ledthtos), both natural and synthetic. _^ 

Mefl,ods«,fonn liposomes^ known to *e an. See for exampte. Pre^ Ed , 

■t,^. Biology-. Vohime XIV, Academic Press, Now York, N.Y. (W P 

" ^ Tie present compounds am also be used in compositions such as tablets or piUs 

mgredieat) b mixea ..^ dicalctam phosphate, gums or 

sucrose, sorbitol, talc, stearic acra, mag» The tablets or pills of the 

similar materials as non-toric, physiologically tollable earners The MM> * 
jresea, compositions can be lamtoated or otherwise compounded to pttmde umt dosa^ 
fnnns affording prolonged or delayed action. . . Hi-rts the 

ft shook! be understood that in addition to the aforementioned earner 

^ •SrSTiSfS a^ be provided witi, a. enteric ^ ~ - - 

envdopethat serves to resist oirintegration in fte aomach ^ 

ST « ^s I-"—'- shenae, sheuac and =^^~ 

ace**, and the .ike. A parody snhable enteric coating conm-« ^ 

aeid copolymer togemer wim known materials that eontnbute to the entenc properties 
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A diagnostic method is also described in the present invention for detecting the 
presence, and preferably the amount, of HTV present in a fluid sample by producing a 
reaction product containing HIV bound to gpl20r. Those skilled in the art will recognize 
that there are well known clinical diagnostic procedures that can be utilized for the 
5 formulation and detection of such reaction products. Thus, while exemplary assay methods 
are described herein, the invention is not intended to be so limited. 

Various heterogeneous and homogeneous assay protocols can be employed for 
detecting the presence, and preferably the amount, of HTV in a fluid sample. For example, 
the present invention contemplates a method for assaying a sample, such as a body fluid, 
10 for the presence of HTV comprising the steps of: 

(a) admixing a fluid sample with gpl20r, either in solution or affixed to a solid 
matrix; 

(b) maintaining the admixture for a predetermined time period such as about 10 
minutes to about 16 - 20 hours and under biological assay conditions at a 

15 temperature of about 4°C to about 45°C that is sufficient for any HTV 

present in the sample to react with (bind) the gpl20r to form a reaction 
product; and 

(c) determining the presence of any reaction product that is formed, and thereby 
the presence of any HTV in the admixture. 

20 Preferably, the fluid sample is a body fluid sample, such as blood, plasma, serum, 

urine, saliva, semen or cerebrospinal fluid (CSF). 

The determination of the presence of a reaction product, either directly or 
indirectly, can be accomplished by assay techniques well known in the art such as by the 
use of an indicating or labelling means, as discussed hereinbelow. In a preferred 

25 embodiment, a labelled indicating means, such as a fluorescein-labelled antibody, is 
capable of binding to the gpl20r present in the reaction product to form a labelled 
complex. Determining the presence of the labelled complex provides an assay for the 
presence of HIV in the sample. In particularly preferred embodiments, the amount of 
labelled indicating means bound as part of the complex is determined, and thereby the 

30 amount of HIV present in the sample is determined. When that amount is zero, no HIV is 
present in the sample, within the limits of detection. Methods for assaying the presence 
and amount of a labelled indicating means depend on the label used, such labels and assay 
methods being well known in the art 

In a preferred embodiment, the gpl20r is affixed on a solid matrix to form a solid 

35 phase support. In that embodiment, the assay is heterogeneous, solid/liquid phase assay 
and, as such, has its own preferred manipulations. For example, following admixing of a 
liquid sample with a solid support containing gpl20r affixed thereto, the admixture is 
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also a detection means. In one embodiment, the package can contain a microliter plate 
well to which microgram quantities of gpl2Qr have been operatively affixed, ie., linked so 
as to be capable of reacting with and bind HIV and/or gpl20. 

As used herein, the terms "label" "indicating means" and "labelled indicating 
5 means", in their various grammatical forms refer to single atoms and molecules that are 
either directly or indirectly involved in the production of a detectable signal to indicate or 
detect the presence of a reaction product. Such labels are themselves well known in 
clinical diagnostic chemistry and constitute a part of this invention only insofar as they are 
utilized with otherwise novel methods and/or systems. 

10 The indicating means can be a fluorescent labelling agent that chemically binds to 

antibodies or protein antigens without denaturing them to form a fluorochrome (dye) that is 
a useful immunofluorescent tracer. Suitable fluorescent labelling agents are fluorochrome, 
such as fluorescein isocyanate (FIC), fluorescein isothiocyanate (FTTC), 5-dimethylamine- 
1-naphthalene sulfonyl chloride (DANSC), tetramethykhodamine isocyanate (TRITC), 

15 lissamine and the like. Immunofluorescence analysis techniques are well known in the art, 
and for example, is described in DeLuca, "Immunofluorescence Analysis" in 
Immunofluorescence Analysis . Marchalonis et al., (1982) eds., John Wiley & Sons, Ltd., 
pp. 189-231, which is incorporated herein by reference. 

Other preferred indicating means are colorimetric agents and enzymes, such as 

20 horseradish peroxidase, glucose oxidase or the like, linked as described above, as well as 
radioactive elements, preferably an element mat produces gamma ray emissions. Elements 
which emit gamma rays, , such as 124 I, 125 I, 128 I, 132 I, and 51 Cr •represent one class of 
radioactive indicating groups. Another group of useful labelling means are those elements 
such as ^C, 18 F, 15 0 and 13 N which emit positrons. The positrons so emitted produce 

25 gamma rays upon interaction with electrons present. 

Having generally described this invention, a further understanding can be obtained 
by reference to certain specific examples which are provided herein for purposes of 
illustration only and are not intended to be limiting unless otherwise specified. 

30 EXAMPLE 1 

Cloninp and Isolation of Non-CD4 Onl40 Receptor Protein 

Human placental membranes were found to be able to bind vaccinia derived 
recombinant gpl20 (vgpl20) with a Kd of 1.3 nM. At nM (concentrations) of gpl20 none 
35 of this binding was inhibited by an antibody (G17-2) which has been reported to efficiently 
block gpl20 binding to CD4 (Linsley et al. (1988) J. Virol. ^:3695-3702), as shown in 
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FIGURE 2A. Approximately 50 - 90% of the total placental gpl20 binding was not due to 
CD4. 

A placental cDNA library was obtained in the mammalian expression vector 
pCDM8 and was screened. A cDNA was isolated which expressed protein that exhibited 
high affinity binding for vgpl20 in the presence of G17-2. 

This protein, designated as gpl20 receptor (gpl20r), also bound native gpl20 
(ngpl20), and the binding component was precipitated in the presence of an antibody 
directed against gpl20. 

EXAMPLE 2 



The binding of radiolabelled gpl20 to gpl20r expressed in COS-7 cells was 
studied. Pools of 90,000 cDNA molecules, obtained from a placental pCDM8 library, 

15 were transfected by electroporation into COS-7 cells. Cells which expressed gpl20r on^the 
surface was identified by screening with either 1 nM of I-labelled vgpl20 ( I- 
vgpl20) or ^-ngpttO by the method described in Kozlowski et al., (1990) Antivir. 
Chem. Chemother. 1:175-182, incorporated herein by reference. The results of binding 
studies utilizing the transfected COS-7 cells are shown in FIGURE 1. 

20 Binding of labelled gpl20 (1 nM) to the cells was carried out following a 1 hour 

preincubation of the cells or GP120 at 22°C with one or more of the following: anti-CD4 
antibody G17-2 (5 ug/ml), baculovirus-derived gpl20 (bgpl20, American Biotechnologies, 
200 nM), anti-gpl20 monoclonal antibody 110.1 (25 pglmL), D-mannose (100 mM), D- 
galactose (100 mM), L-fucose (100 mM), concanavalin A (1 mg/ml) or pradimicin A (100 

25 ug/ml). The cells were monitored after autoradiography (3 days). The results seen in 
FIGURES 1 (A and B) illustrate that gpl20 binding to the gpl20r expressed on the cells 
was blocked by excess bgpl20, mannose, fucose, pradimicin A, Concanavalin A, and 
preincubation with antibody 110.1 but not by CD4, antibody G17-2, galactose, or HIV 
antisera. Studies were also carried out on gpl20 binding to CD4 expressing COS cells, 

30 transfected with n H3MCD4 by the method of Peterson et al. (1988) Cell 54:65-72. 

Control studies of the binding of 125 I-labelled psoralen-UV inactivated FHV-BRU 
to the gpl20r expressing COS-7 cells demonstrated binding of HTV to gpl20r and blockage 
by excess bgpl20 (FIGURE 1C). A tabular compilation quantitating the amount of bound 
material to the cells in FIGURE 1 is shown in Table. 1 
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30 Scatchard plots of gpl20 binding to placental membranes and to COS cells 

expressing the gpl20r were carried out in the presence and absence of a 200 fold excess of 
bgpl20 or ngpl20. The results, shown in FIGURE 2A, demonstrate a specific binding of 
vgpl20 to gpl20r with a Kd of 1.7 nM ± 0.4 (n=4) and of ngpl20 to gpl20r with Kd of 
1.8 nM ± 0.2 (n=4), with 150,000 and 149,000 receptors per cell, respectively. 

35 Concurrent analysis of gpl20 binding to CD4 expressed on COS cells gave a Kd of 4-5 nM 
in agreement with previous reports (Iinsley, P,S. et al. (1988) J. Virol 62:3695-3702; 
Schnittman, et al. (1988) J. Immunol. 141:4181-4186). Calculations from the association 
and dissociation rate constants gave a similar comparative result. The expressed gpl20r 
has a relative molecular mass (Mr) of "48,500 and a protein of similar size was also 

40 partially purified from placental membranes (FIGURE 2D). 

The placental membranes and COS cells were surface iodinated, and treated with 1 
nM unlabelled vgpl20, then washed with Blotto RPMI, 5% BSA, 1% Non-fat dry milk, 
0.2% sodium azide solubilized in Triton X-100 (1% in PBS with a protein inhibitor 
cocktail, PMSF, Pepstatin A, orthophenathroline and leupeptin) and immunoprecipitated 



BNSDOCID: <WO_9301820A2J_> 



-14- 



with HIV or control human sera, according to the method described in Curtis et al. (1990) 

J. Immunol. 144:1295-1303. 

Northern analysis of the expression of the gpl20r RNA indicated a major species of 
- 5 kb and a minor species of '1.7 Id) which may represent an alternatively processed 
5 transcript and is more consistent with the size of the gpl20r cDNA. RNA was denatured, 
separated in an agarose gel, transferred to nitrocellulose, hybridized to gpl20r cDNA and 

autoradiographed for 3 days. 

Expression of gpl20r RNA was highest in colon followed by thymus, placenta, 
heart, skeletal muscle, and was not detected in liver or kidney. Low levels of expression 
10 in brain, T cell, B cell, and macrophage (FIGURE 2E) require verification by polymerase 
chain reaction <PCR). Full length CD4 RNA was highest in thymus, T cell, and 
macrophage followed by placenta and colon (not shown). 

The gpl20r cDNA encodes a protein of 404 amino acids with a calculated Mr of 

45,775 (FIGURE 3A). 

15 Sequencing of both strands of gpl20r cDNA was carried out by the dideoxy chain 

termination method. The nucleotide sequence proceeding the first ATG agrees with the 
Kozak consensus. The predicted cytoplasmic domain has a similar length and shows some 
sequence homology to other type E membrane protein C-type lectins (Spiess, M. (1990) 
Biochemistry 29_: 10009-10018). The membrane spanning sequence is underlined and was 

20 predicted in part by homology to related sequences in FIGURE 3C. The potential N- 
Enked glycosylation site is marked by an asterisk. The start of the seven complete and 
eighth partial tandem repeats are indicated (R1-R8). The consensus repeat sequence is 
IYQELT(R/Q) LKAAVGELPEKSKLQE. The beginning of the lectin domains is also 
indicated (L). No signal sequence was apparent but instead demonstrated homology to a 

25 family of Type E membrane proteins which utilize a "20 residue hydrophobic stop-transfer 
sequence for membrane translocation. The "positive inside rule" (von Heijne, G. et al. 
(1988) Eur. J. Biochem. 174:671-678) for the sequence within fifteen residues of the 
transmembrane region predicts a cytoplasmic amino terminus in agreement with the 
homology to membrane associated C-type lectins with similar membrane orientation 

30 (FIGURE 3Q (Spiess, M. (1990) Biochemistry 22:10009-10018). This region, Met 1 to 
Ala 76, represents the first domain of the gpl20r sequence. 

The second domain (lie 77 to Val 249) consists of tandem repeats of nearly 
identical sequence (FIGURE 3A). This region was predicted to consist of a series of 
amphipathic a-helices interrupted by B-tums. Circular Dichroism spectra in 40% 

35 trifluoroethanol of a consensus repeat peptide beginning with the B-turn, 
PEKSKLQEIYQELTQLKAAVGEL (single-letter arnino-acid code), demonstrated an all a- 
hetical structure (not shown). Homology to other repeat domains suggested three possible 
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tertiary structures, (1) antiparallel helix bundles, (2) a multimeric parallel helix bundle, and 
(3) a membrane pore with a hydrophobic exterior and a negatively charged interior. The 
first two models would function as spacers to separate the lectin domain from the 
membrane, while the third could generate a transmembrane signal after ligand binding. 
5 The third domain (Cys 253 to Ala 404) is homologous to the other known C-type 

lectins which are type II membrane proteins (FIGURE 3C). With the exception of the 
IgEr, these lectins bind terminal D-galactose and D-N-acetylgalactosamine of glycoproteins 
(Spiess, M. (1990) Biochemistry 22: 10009-10018). 

The most closely related sequences were the group of Type II membrane protein C- 

10 type lectins: Chick hepatic lectin (CHL) (Drickamer, KJ. (1981) Biol. Chem. 256:5827- 
5839), low affinity IgE receptor (IgEr) (Kikutani, H. et al. (1986) Cell 42: 657-665), the 
asialoglycoptorrin receptors (human HI and H2 (Spiess, M. et al. (1985) Proc. Natl. 
Acad. Sci. USA £2:6465-6569) are shown), and the rat Kupffer cell receptor (Hoyle, 
G.W. et al. (1988) J. Biol. Chem. 263:7487-7492^, The most similar mannose binding 

15 lectin was one of the eight carbohydrate recognition domains of the human macrophage 
mannose receptor (Mannr) (Taylor, M.E. et al. (1990) J. Biol. Chem. 265:12156-12162; 
Ezekowitz, R.A.B. et al. (1990) J. Exp. Med. 172:1785-1794^. Residues iden^vto thez 
gpl20r are boxed. ALIGN scores indicate significant sequence similarity if gf^t» : dianv 
3.0. The complete gpl20r sequence was most homologous to the Kupffer cell receptor 

20 which has a similar tandem repeat (Hoyle, G.W. et al. (1988) J. BioL Chem. 262:7487- 
7492). 

The inability to crosslink gpl20 to the non-CD4 sites on placenta and brain cell 
lines (not shown) was consistent with an interaction of the gpl20r with carbohydrate, and 
polyclonal HTV antisera added to gpl20 blocked binding to CD4 but not to the gpl20r 

25 (FIGURE IB). Galactose and N-acetylgalactosamine did not block gpl20 binding, but 
mannose and fucose completely blocked binding to the gpl20r without an effect on CD4 
(FIGURE IB). Inhibition by a series of sugars is shown in FIGURE 2B. Human IgE (10 
/zg/ml), sialic acid (100 mM), and mannose-6-phosphate (100 raM) had no effect on 
binding to the gpl20r. The three forms of gpl20 used have different oligosaccaride 

30 structures. Bgpl20 contains only high mannose structures (Hsieh, P. et al. (1984) J. Biol. 
Chem. 252:2375-2382). Vgpl20 has equal proportions of high mannose and complex 
(Mizuochi, T. et al. (1988) Biochem. J. 254:599-603^ similar to ngpl20 which has a 
greater structural diversity in the complex chains (Geyer, H. et al. (1988) J. Biol. Chem. 
2^2:11760-11767; Mizuochi, T. et al. (1990) J. Biol. Chem. 2£5i85 19-8524). The 

35 affinity of the gpl20r for all three forms was similar (FIGURE 2A ) suggesting that the 
terminal mannose of high mannose chains are the primary determinants of binding. As 
expected for a C-type lectin the gpl20r required calcium and binding was blocked by 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Curtis, Benson 

(ii) TITLE OF INVENTION: INHIBITION OF NON-CD4 MEDIATED 

HIV INFECTION 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE: 


Bristol-Myers Squibb Company 


(B) 


STREET: 


3005 First Avenue 


(Q 


CITY: 


Seattle 


(D) 


STATE: 


Washington 


(E) 


COUNTRY: 


USA 


(F) 


ZIP: 


98121 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
(Q OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patenttn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US UNKNOWN 

(B) FILING DATE: ll-JUL-1991 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) ! NAME: Sorrentino, Joseph M. 

(B) REGISTRATION NUMBER: 32,598 

(C) REFERENCE/DOCKET NUMBER: ON0086- 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 728-4800 

(B) TELEFAX: (206) 448-4775 
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(2) 

ft 



(ii) 
(vi) 



INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

MOLECULE TYPE: cDNA 
ORIGINAL SOURCE: 
(A) ORGANISM: 
FEATURE: 



1312 base pairs 
nucleic acid 
double 
linear 



Human immunodeficiency virus type 1 



(A) NAME/KEY: CDS 

(B) LOCATION: 42.. 1253 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTAAAGCAGG AGTTCTGGAC ACTGGGGGAG AGTGGGGTGA C ATG AGT GAC TCC 53 

Met Ser Asp Ser 
1 

AAG GAA CCA AGA CTG CAG CAG CTG GGC CTC CTG GAG GAG GAA GAG CTG 101 
Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu Glu Glu Gin Leu 
5 10 15 20 

AGA GGC CTT GGA TTC CGA CAG ACT CGA GGA TAC AAG AGC TTA GCA GGG 149 
Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lys Ser Leu Ala Gly 
25 30 35 

TGT CTT GGC CAT GGT CCC CTG GTG CTG CAA CTC CTC TCC TTC ACG CTC 197 
Cys Leu Gly His Gly Pro Leu Val Leu Gin Leu Leu Ser Phe Thr Leu 
40 45 50 

TTG GCT GGG CTC CTT GTC CAA GTG TCC AAG GTC CCC AGC TCC ATA AGT 245 
Leu Ala Gly Leu Leu Val Gin Val Ser Lys Val Pro Ser Ser He Ser 
55 60 65 

CAG GAA CAA TCC AGG CAA GAC GCG ATC TAC CAG AAC CTG ACC CAG CTT 293 
Gin Glu Gin Ser Arg Gin Asp Ala He Tyr Gin Asn Leu Thr Gin Leu 
70 75 80 

AAA GCT GCA GTG GGT GAG CTC TCA GAG AAA TCC AAG CTG CAG GAG ATC 341 
Lys Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys Leu Gin Glu He 
85 90 95 100 
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TAC CAG GAG CTG ACC CAG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG 389 
Tyr Gin Glu Leu Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
105 110 115 

AAA TCT AAG CTG CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT 437 
Lys Ser Lys Leu Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu Lys Ala 
120 125 130 

GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CTG CAG GAG ATC TAC CAG 485 
Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin Glu lie Tyr Gin 
135 140 145 

GAG CTG ACC TGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT 533 
Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser 
150 155 160 

AAG ATG CAG GAG ATC TAC CAG GAG CTG ACT CGG CTG AAG GCT GCA GTG 581 
Lys Met Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val 
165 170 175 180 

GGT GAG CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG 629 
Gly Glu Leu Pro Glu Ly6 Ser Lys Gin Gin Glu lie Tyr Gin Glu Leu 
185 190 195 

ACC CGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CAG 677 
Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin 
200 205 210 

CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT GCA GTG GGT GAG 725 
Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu 
215 " 220 225 

CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG ACC CAG 773 
Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Gin 
230 235 240 

CTG AAG GCT GCA GTG GAA CGC CTG TGC CAC CCC TGT CCC TGG GAA TGG 821 
Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys Pro Trp Glu Trp 
245 ~ 250 255 260 

ACA TTC TTC CAA GGA AAC TGT TAC TTC ATG TCT AAC TCC CAG CGG AAC 869 
Thr Phe Phe Gin Gly Asn Cys Tyr Phe Met Ser Asn Ser Gin Arg Aen 
265 270 275 

TGG CAC GAC TCC ATC ACC GCC TGC AAA GAA GTG GGG GCC CAG CTC GTC 917 
Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly Ala Gin Leu Val 
280 285 290 

GTA ATC AAA AGT GCT GAG GAG CAG AAC TTC CTA CAG CTG CAG TCT TCC 965 
Val He Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin Leu Gin Ser Ser 
295 300 305 
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AGA AGT AAC CGC TTC ACC TGG ATG GGA CTT TCA GAT CTA AAT CAG GAA 1013 
Arg Ser Asn Arg Phe Thr Trp Met: Gly Leu Ser Asp Leu Asn Gin Glu 
310 . " 315 320 

GGC ACG TGG CAA TGG GTG GAC GGC TCA CCT CTG TTG CCC AGO TTC AAG 1061 
Gly Thr Trp Gin Trp Val Asp Gly Ser Pro Leu Leu Pro Ser Phe Lys 
325 330 335 340 

CAG TAT TGG AAC AGA GGA GAG CCC - AAC AAC GTT GGG GAG GAA GAC TGC 1109 
Gin Tyr Trp Asn Arg Gly Glu Pro Asn Asn Val Gly Glu Glu Asp Cys 
345 350 355 

GCG GAA TTT AGT GGC AAT GGC TGG AAC GAC GAC AAA TGT AAT CTT GCC 1157 
Ala Glu Phe Ser Gly Asn Gly Trp Asn Asp Asp Lys Cys Asn Leu Ala 
360 365 370 

AAA TTC TGG ATC TGC AAA AAG TCC GCA GCC TCC TGC TCC AGG GAT GAA 1205 
Lys Phe Trp lie Cys Lys Lys Ser Ala Ala Ser Cys Ser Arg Asp Glu 
375 380 385 

GAA CAG TTT CTT TCT CCA GCC CCT GCC ACC CCA AAC CCC CCT CCT GCG 1253 
Glu Gin Phe Leu Ser Pro Ala Pro Ala Thr Pro Asn Pro Pro Pro Ala 
390 395 400 

TAGCAGAACT TCACCCCCTT TTAAGCTACA GTTCCTTCTC TCCATCCTTC GACCTTTAG 1312 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids \dfr^ bO&VG^ ^ ^jTL 

(B) TYPE: amino acid , ^1 f ^r^^x ^^4^ 
(D) TOPOLOGY: linear / ' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met Ser Asp Ser Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu 
1 5 10 15 

Glu Glu Gin Leu Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lys 
20 25 30 

Ser Leu Ala Gly Cys Leu Gly His Gly Pro Leu Val Leu Gin Leu Leu 
35 40 45 

Ser Phe Thr Leu Leu Ala Gly Leu Leu Val Gin Val Ser Lys Val Pro 
50 55 60 
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Ser Ser lie Ser Gin Glu Gin Ser Arg Gin Asp Ala lie Tyr Gin Asn 
65 70 75 80 

Leu Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys 
85 90 95 

Leu Gin Glu He Tyr Gin Glu Leu Thr Gin Leu Lys Ala Ala Val Gly 
100 105 HO 

Glu Leu Pro Glu Lys Ser Lys Leu Gin Glu He Tyr Gin Glu Leu Thr 
115 120 125 

Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin 
130 135 140 

Glu He Tyr Gin Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu 
145 150 155 160 

Pro Glu Lys Ser Lys Met. Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu 
165 170 175 

Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He 
180 185 190 

Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
195 200 205 

Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala 
210 215 220 

Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin 
225 230 235 240 

Glu Leu Thr Gin Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys 
245 250 255 

Pro Trp Glu Trp Thr Phe Phe Gin Gly Asn Cys Tyr Phe Met Ser Asn 
260 265 270 

Ser Gin Arg Asn Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly 
275 280 285 

Ala Gin Leu Val Val lie Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin 
290 295 300 

Leu Gin Ser Ser Arg Ser Asn Arg Phe Thr Trp Met Gly Leu Ser Asp 
305 310 315 320 

Leu Asn Gin Glu Gly Thr Trp Gin Trp Val Asp Gly Ser Pro Leu Leu 
325 330 335 

Pro Ser Phe Lys Gin Tyr Trp Asn Arg Gly Glu Pro Asn Asn Val Gly 
340 345 350 
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355 



Al . xy. Ph. Trp II. cy. Ly. xys Ser U. -r cy. 



375 



Cys Asn Leu 
370 

Ser « *sp OXu OXu Oln Phe Ser Pro £a Pro -a XHr Pro *sn 
385 390 
Pro Pro Pro Ala 

(2 ) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 
(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: 



127 amino acids 
amino acid 
linear 

protein 

internal 



Human immunodeficiency virus type 1 



SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Cys 



His Pro Cys Pro Trp 



Glu Trp Thr Phe Phe Gin GXy Asn Cys Tyr 



10 



w4« asd ser lie Thr Ala Cys 
Phe Met Ser Asn Ser Gin Arg Asn Trp Hxs Asp ^ 

20 2i> 



Lye OXu VeX SXy cm Leu V.1 V.X lie Ly. Ser *X. OX. «- 
< 35 40 



Asn Phe Leu 
50 



Gin Leu Gin ser Ser Arg Ser Asn Arg 
55 OU 



45 

Phe Thr Trp Met 



« ly Leu s« «P «u *.„ GX» OXu OXy Thr TrP cx„ Trp vaX hep GXy 

70 

pre Leu Leu Pro ser Phe Lys CXn Tyr Trp hen *r= GXy OXu Pro 



65 
Ser 
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Asn Asn val Gly Glu Glu Asp Cys Ala Glu Phe Ser Gly Asn Gly Trp 
100 105 110 

Asn Asp Asp Lys Cys Asn Leu Ala Lys Phe Trp lie Cys Lys Lys 
115 120 . 125 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Cys Gly Ala Gin Ser Arg Gin Trp Glu Tyr Phe Glu Gly Arg Cys Tyr 
15 10 15 

Tyr Phe Ser Leu Ser Arg Met Ser Trp His Lys Ala Lys Ala Glu Cys 
20 25 30 

Glu Glu Met His Ser His Leu lie lie lie Asp Ser Tyr Ala Lys Gin 
35 40 45 

Asn Phe Val Met Phe Arg Thr Arg Asn Glu Arg Phe Trp lie Gly Leu 
50 55 60 

Thr Asp Glu Asn Gin Glu Gly Glu Trp Gin Trp Val Asp Gly Thr Asp 
65 70 75 80 

Thr Arg ser Ser Phe Thr Phe Trp Lys Glu Gly Glu Pro Asn Asn Arg 
85 90 95 

Gly Phe Asn Glu Asp Cys Ala His Val Trp Thr Ser Gly Gin Trp Asn 
100 105 110 

Asp Val Tyr Cys Thr Tyr Glu Cys Tyr Tyr Val Cys Glu Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) 



SEQUENCE DESCRIPTION: SEQIDNO: 5: 

Cys Asn Thr Cys Pro Glu Lys Trp He Asn Phe Gin Arg Lys Cys Tyr 



Tyr Phe Gly Lys Gly Thr Lys Gin Trp Val His Ala Arg Tyr Ala Cys 
20 25 30 

Asp Asp Met Glu Gly Gin Leu Val Ser He His Ser Pro Glu Glu Gin 
35 40 45 

Asp Phe Leu Thr Lys His Ala Ser His Thr Gly Ser Trp He Gly Leu 
50 55 60 

Arg Asn Leu Asp Leu Lys Gly Glu Phe He Trp Val Asp Gly Ser His 
65 70 75 

Val Asp Tyr Ser Asn Trp Ala Pro Gly Glu Pro Thr Ser Arg Ser Gin 
85 9° 95 

Gly Glu Asp Cys Val Met Met Arg Gly Ser Gly Arg Trp Asn Asp Ala 
100 105 HO 

Phe cys Asp Arg Lys Leu Gly Ala Trp Val Cys Asp Arg 
115 120 125 

(2) INFORMATION FOR SEQIDNO: 6: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: 



internal 



(xi) SEQUENCE DESCRIPTION: SEQIDNO: 6: 



Arg Thr Cys Cys Pro Val Asn Trp Val Glu His Glu Arg Ser Cys Tyr 
15 10 15 

Trp Phe Ser Arg Ser Gly Lys Ala Trp Ala Asp Ala Asp Asn Tyr Cys 
20 25 30 

Arg Leu Glu Asp Ala His Leu Val Val Val Thr Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe Val Gin His His lie Gly Pro Val Asn Thr Trp Met Gly Leu 
50 55 60 

His Asp Gin Asn Gly Pro Trp Lys Trp Val Asp Gly Thr Asp Tyr Glu 
65 70 75 80 

Thr Gly Phe Lys Asn Trp Arg Pro Glu Gin Pro Asp Asp Trp Tyr Gly 
85 90 95 

His Gly Leu Gly Gly Gly Glu Asp Cys Ala His Phe Thr Asp Asp Gly 
100 105 110 

Arg Trp Asn Asp Asp Val Cys Gin Arg Pro Tyr Arg Trp Val Cys Glu 
115 120 125 

Thr 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 



129 amino acids 
amino acid 
linear 



(ii) MOLECULE TYPE: 
(v) FRAGMENT TYPE: 



protein 
internal 
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INSCRIPTION: SEQ ID NO: 7: 



(xi) SEQUENCEDESCRIPHON: SEQ ID NO: 7: 

Arg Thr Cys Cys Pro Val Asn Trp Val Glu His Gin Gly Ser Cys Tyr 
1 5 XO - 15 

Trp Phe Ser His Ser Gly Lys Ala Trp Ala Glu Ala Glu Lys Tyr Cys 
20 25 30 

Gin Leu Glu Asn Ala His Leu Val Val He Asn Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe He Val Gin His Thr Asn Pro Phe Asn Thr Trp He Gly Leu 
50 55 60 

Thr Asp Ser Asp Gly Ser Trp Lys Trp Val Asp Gly Thr Asp Tyr Arg 
65 70 75 80 

His Asn Tyr Lys Asn Trp Ala Val Thr Gin Pro Asp Asn Trp His Gly 
85 90 9 

His Glu Leu Gly Gly Ser Glu Asp Cys Val Glu Val Gin Pro Asp Gly 
100 105 HO 

Arg Trp Asn Asp Asp Phe Cys Leu Gin Val Tyr Arg Trp Val Cys Glu 
115 120 125 

Lys 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Gin Leu lie Met Gin Asp Trp Lys Tyr Phe Asn Gly Lys Phe Tyr 
1 5 io 15 
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Tyr Phe Ser Arg Asp Lys Lys Ser Trp His Glu Ala Glu Asn Phe Cys 
20 25 30 

Val Ser Gin Gly Ala His Leu Ala Ser Val Thr Ser Gin Glu Glu Gin 
35 40 45 

Ala Phe Leu Val Gin He Thr Asn Ala Val Asp His Trp He Gly Leu 
50 55 60 

Thr Asp Gin Gly Thr Glu Gly Asn Trp Arg Trp Val Asp Gly Thr Pro 
65 70 75 80 

Phe Asp Tyr Val Gin Ser Arg Arg Phe Trp Arg Lys Gly Gin Pro Asp 
85 90 95 

Asn Trp Arg His Gly Asn Gly Glu Arg Glu Asp Cys Val His Leu Gin 
100 105 HO 

Arg Met Trp Asn Asp Met Ala Cys Gly Thr Ala Tyr Asn Trp Val Cys 
115 120 125 



Lys Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Pro Thr His Cys Pro Ser Gin Trp Trp Pro Tyr Ala Gly His Cys Tyr 
! 5 10 15 

Lys He His Arg Asp Glu Lys Lys He Gin Arg Asp Ala Leu Thr Thr 
20 25 30 

Cys Arg Lys Glu Gly Gly Asp Leu Thr Ser He His Thr He Glu Glu 
35 40 45 
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Leu Asp Phe lie He Ser Gin Leu Gly Leu Glu Pro Asn Asp Glu Leu 
• 50 55 60 

Trp He Gly Leu Asn Asp He Lys He Gin Met Tyr Phe Glu Trp Ser 
65 70 75 

Asp Gly Thr Pro Val Thr Phe Thr Lys Trp Leu Arg Gly Glu Pro Ser 
His Glu Asn Asn Arg Gin Glu Asp Cys Val Val Met Lys Gly Lys Asp 
Gly Tyr Trp Ala Asp Arg Gly Cys Glu Trp Pro Leu Gly Tyr He Cys 



115 



120 



Lys Met 
130 
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We claim: 

1. A method of inhibiting HIV infection of mammalian cells comprising contacting the 
cells with an effective amount of a compound selected from the group consisting of 
a mannose carbohydrate, a fucose carbohydrate, a lectin and a drug, for a time 
period sufficient to significantly inhibit the binding of HIV to a non-CD4 cell 
surface protein. 

2. The method of Claim 1, wherein the non-CD4 cell surface protein is a gp!20 
receptor having a specific binding affinity for gpl20 of about Kd = 1.3 nM to 
about Kd = 2.0 nM. 

3. The method of Claim 2, wherein the gp!20 receptor is present on placental cells. 

4. The method of Claim 2, wherein the gpl20 receptor is present on muscle cells. 

5. The method of Claim 2, wherein the gp!20 receptor is present on neural cells. 

6. The method of Claim 5, wherein the neural cells are brain cells. 

7. The method of Claim 5, wherein the neural cells are dendritic cells. 

8. The method of Claim 2, wherein the gpl20 receptor is present on mucosal cells. 
10. The method of Claim 1, wherein the compound is mannose. 

10. The method of Claim 1, wherein the compound is fucose. 

11. The method of Claim 1, wherein the compound is a mannose-containing 
carbohydrate. 

12. The method of Claim 11, where the carbohydrate is mannan. 

13. The method of Claim 1, wherein the compound is a pradimicin A antibiotic. 

14. A substantially purified non-CD4 gpl20 receptor protein comprising a protein 
substantially corresponding to a non-CD4 mammalian cell surface protein that has a 
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specific bJSfaffinity for gpl20, said protein coining about 400 

iidues, having a mo.ecu.ar weigh, of abou, 45,000 daltons and having a bmdmg 

affinity for gpl20 characterized by a Kd of about 1.3 nM to about 2 nM. 

The gpl20 recepmr protein of Oaim .4, wherein the binding of the gp!20 u^eptor 
pnjn to gp!20 is inhibit by a compound selected from me group conashng of a 
mannose carbohydrate, a facose carbohydrate, a lectin and a drug. 

The gpl20 receptor of Claim 15, wherein the compound is mannose. 

The gpl20 rector protein of Chum 15, wherein the compound is a pmdimicin A 
antibiotic. 

The gpl20 receptor protein of Claim 14, wherein the protein is produced by 
recombinant means. 

r rn*im 1R wherein said recombinant means 
The eol20 receptor protein of Claim 18, wnercm ^ 
^pSes theZng of a cDNA isolated ftom a library o, recombmant placenta. 



A DNA molecule encoding the gp.20 receptor protein of Oatm 14, « *e 

DNA is a complementary DNA that transcribes an mRNA found m 

from me group consisting of placental colls, brain cefis, muscte cefis and colon 

cells. 

A method of detecting the presence of HIV in a sample comprising: 

(a) admixing in an aqueous medium a sample to be assayed wxth a non- 
CD4 gpl20 receptor protein having a specific binding affinity for gpl20 
characterized by a Kd of about 1.3 nM to about 2.0 nM in an amount 
sufficient to carry out at least one assay; 

(b) maintaining the admixture for a time period sufficient for the gpl20 
receptor protein to bind to any HTV present in the sample and form a 

reaction product; and . . 

(c) determining the presence of the HTV containing reaction product. 
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22. 

23. 
24. 

25. 
26. 



The method of Claim 21, wherein the gpl20 receptor protein contains about 
400 amino acid residues and has a molecular weight of about 45,000 
daltons. 

The method of Claim 21, wherein the gpl20 receptor protein is affixed to a 
solid matrix to form a solid support. 

The method of Claim 21, wherein the presence of the reaction product is 
determined by contacting the sample with a reagent capable of detecting the 
bound gpl20 receptor protein. 

The method of Claim 24, wherein the reagent is a labelled antibody directed 
against the gpl20 receptor protein. 

A diagnostic system in kit form, for assaying for the presence of HTV in a 
fluid sample, comprising a package containing a non-CD4 receptor protein 
having a specific affinity for gpl20 characterized by a Kd of about 1.3 nM 
to about 2.0 nM, and instructions for use. 

The diagnostic system of Claim 26, wherein the non-CD4 gpl20 receptor 
protein is affixed to a solid matrix to form a solid support. 
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Figure 1A 
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Figure IB 
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Figure 1C 
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Figure 2B 
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Figure 2D 
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Figure 2E 
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1 CTAAAGCAGGAGTTCTGGACACTGGGGGAGAGTGGGGTGAC 

4 2 ATGJ^fcACTCCAAGGAACCAAGACTGCAGCAGq^^GCCTCCTGGAGGAGGAACAGCTG 

1 M SKEPRLQQ I» L L E E E Q L 

102 AGAGGCCTTGGATTCCGACAGACTCGAGGATACAAGAGCTTAGCAGGGTGTCTTGGCCAT 
21 RGLGFRQTRGYKSLAGCLGH 

162 GGTCCCCTGGTGCTGCAACTCCTCTCCTTCACGCTCTTGGCTGGGCTCCTTGTCCAAGTG 
41 GPLVLOLLST-TL LAG L L V Q V 

222 TCCAAGGTCCCCAGCTCCATAAGTCAGGAACAATCCAGGCAAGACGCGATCTACCAGAAC 
61 SKVPSSISQEQSRQDAIYQN 

Rl * 

282 CTGACCCAGCTTAAAGCTGCAGTGGGTGAGCTCTCAGAGAAATCCAAGCTGCAGGAGATC 
81 LTQLKAAVGELSEKSKLQEI 

R2 

342 TACCAGGAGCTGACCCAGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCTG 
101 YQELTQLKAAVGELPEKSKL 

402 CAGGAGATCTACCAGGAGCTGACCCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAA 
121 QEI YQELTRLKAAVGELPEK 

R3 

4 62 TCTAAGCTGCAGGAGATCTACCAGGAGCTGACCTGGCTGAAGGCTGCAGTGGGTGAGCTT 
141 SKLQEIYQELTWLKAAVGEL 

R4 

522 CCAGAGAAATCTAAGATGCAGGAGATCTACCAGGAGCTGACTCGGCTGAAGGCTGCAGTG 
161 PEKSKMQEIYQELTRLKAAV 

R5 

582 GGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACCCGGCTGAAG 
181 GELP EKSKQQEI YQELT RL.K 

R6 

64 2 GCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACC 
201 AAVG ELPEKSKQQEI YQELT 

R7 

702 CGGCTGAAGGCTGCAGTGGGTGAGCTTCCAG AGAAATCTAAGCAGCAGGAGATCTACCAG 
221 RLKAAVGELPEKSKQQEIYQ 

R8 

7 62 GAGCTGACCCAGCTGAAGGCTGCAGTGGAACGCCTGTGCCACCCCTGTCCCTGGGAATGC 
241 ELTQLKAAVERLCHPCPWEW 



822 
261 



ACATTCTTCCAAGGAAACTGTTACTTCATGTCTAACTCCCAGCGGAACTGGCACGACTCC 
TFFQGNCYFMSNSQRNWHDS 



Figure 3A 
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882 ATCACCGCCTGCAAAGAAGTGGGGGCCCAGCTCGTCGTAATCAAAAGTGCTGAGGAGCAG 

281 ITACKEVGAQLVVIKSAE EQ 

942 AACTTCCTACAGCTGCAGTCTTCCAGAAGTAACCGCTTCACCTGGATGGGACTTTCAGAT 

301 NFLQLQSSRSNRFTWMGLSD 

1002 CTAAATCAGGAAGGCACGTGGCAATGGGTGGACGGCTC ACCTCTGTTG CCCAG CTTCAAG 

321 L NQEGTWQWV DGSP LLP S FK 

1062 CAGTATTGGAACAGAGGAGAGCCCAACAACGTTGGGGAGGAAGACTGCGCGGAATTTAGT 

341 QYWNRGEPN NVGEEDCAE FS 

1122 GGCAATGGCTGGAACGACGACAAATGTAATCTTGCCAAATTCTGGATCTGCAAAAAGTCC 

361 GNGWNDDKCNLAKFWICKKS 

1182 GCAGCCTCCTGCTCCAGGGATGAAGAACAGTTTCTTTCTCCAGCCCCTGCCACCCCAAAC 

381 AASCSRDEEQFLSPAPATPN 

1242 CCCCCTCCTGCGTAGCAGAACTTCACCCCCTTTTAAGCTACAGTTCCTTCTCTCCATCCT 

401 P P P A *** 

1302 TCGACCTTTAG 



Figure 3A(cont.) 
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Figure 3B 
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huraan/anlmal body the search has been carried out and based on the alleged 
effects of the compound/composition. 

2 ^ ^ because'ttary relate to parts of the international application that do not comply with the prescribed requirements to such 

an extent that no meaningful international search can be carried out, specifically: 



3 " ^ tecaurc'tay are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

Please see our Invitation PCT/ISA/206 of 30.11.92 



l. [~| as all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

2 - 1 I As ail searchable claims could be searches without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. (in As only some of the required additional search fees were timely paid by the applicant, this international search report 
1 ' covers only those claims for which fees were paid, specifically claims Nos.: 

1-8 (partially), 9-12 
14-27 

4. |~1 No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest [Y] The additional search fees were accompanied by the applicant's protest. 

[ | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (I)) (July 1992) 

BNSDOCID: <WO 9301820A3 I > 
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This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 



U FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



J-^REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 
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